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SPECIFICATION . • 

Process for Obtaining DNA, RNA, Peptides, 

Polypeptides, or Proteins by Recombinant DNA 

Technique 



The present invention has as its object a process 
for obtaining DNA, RNA, peptides, polypeptides or 
:proteih$, through use of transformed host cells 
containing genes capable of expressing these RNAs, 
1 0 peptides, polypeptides, or proteins; that js to say, by 
utilisation of recombinant DNA technique. 

The invention aims in particular at the production 
of stochastic genes or fragments of stochaslicgenes 
in a fashion to permit obtaining simultaneously, 
15 after transcription and translation of these genes, a 
very large number (on the order of at least 1 0,000) of 
completely new proteins, in the presence of host 
cells (bacterial or eucaryotic) containing these genes 
respectively capable of expressing these proteins, 
20 and to carry out'thereafter a selection or screen 
among the said clones, in order to determine vyhich 
of them produce protiains with desired properties, 
for example structural, enzymaiic, i^taiy uc, 
antigenic, phannacologic, or properties of 
25 liganding, and more generally, cheniical, 
biochemical, biological, etc, properties. 

The invention also has as its aim procedures to 
obtain, sequences of DNA or RNA with utilizable 
properties notably chemical, biochemical, or 
30 biological properties. 

It is clear, therefore, that the invention Is open to a 
very large number of applications in very m.iny 
areas of science, industry and medicine. 
The process for production of peptides or 
35 polypeptides according to the invention is 
• characterized in that one produces simultaneously, 
in the same medium, genes which are at least 
partially composed of synthetic stochastic 
polynucleotides, that one introduces the genes thus 
40 obtained into host cells, that one cultivates 
simultaneously the independent clones of the 
transformed host cells containing these genes in 
. such a manner so^as to clone the- stochastfc genes 

and to obtain the production of the proteins 
45 expressed by each of these stochastic genes, that 
one carries out selection and/or s'*reening of the 
clones of transformed host cells in a manner to 
identify those clones producing peptides or 
polypeptides having at least one desired activity, 
50 that one thereafter isolates the clones thus Identified 
and that one cultivates them to produce at least one 
peptide or pofypeptrde having the saidprope-'tv- 

In a first mode of carrying out this process, 
stochcstic genes are produced by stochastic 
55 copolymerization of the four kinds of 

deoxyphosphonucleolides. A, C, G and T from the 
two ends of an initially linearized expression vector, 
followed by forrhalion of cohesive ends in such a 
fashtun as to form a stochastic first strand of DNA 
60 constituted by a molecule of expression vector 
possessing two stochastic sequences whose 3' ends 
are complementary, followed by the synthesis of the 
second strand of the stochastic DNA, 
In a second mode of carrying out this process, 
65 stochastic genes are produced by copolymerization 



of oligonucleotides without cohesive ends/in a 
manner to form fragments of stochastic DNA, 
followed by ligation of these fragments to a 
previously linearized expression vector. 
70 The expression vector can be a plasmid, notably a 
bacterial plasmid. Excellent results have been 
obtained using the plasmid pUC8 as the expression 

* vector. . ir^fciA • 

The expression vector can also be viral DNA or a 
75 hybrid of plasmid and viral DNA. 

The host colls can be prokaryotic cells such as HB 
101 and C 600, or eukaryotic cells. 

When utilizing the procedure according to the 
second mode mentioned above, it is possible to 
80 utilize oligonucleotides which form a group of 
palindromic octamers. 

Particulariy good results are obtained by utilizing 
the following group of palindromic octamers: 



85 



90 



95 



5' G6AATTCC3' 

5' GGTCGACC3' 

5'CAAGCTTG 3' 

5' CCATATGG 3' * 

5'CATCGATG3' 

It is also possible to use oligonucleotides which 
form a group of palindromic heptamere. 

Very good results are obtained utilizing the 
following group of palindromic heptamers: 

5'XTCGCGA3' 
5' XCTGCAG 3' 
5' RGGTACC3' 



100 whereX«A,G,CorT,andR=AorT 

According to a method to utilize these procedures 
which is particularly advantageous, one isolates and 
purifies the transforming DNA of the plasmids from 
a culture of Independent clones of the tra-.^sformed 
1 05 host cells obtained by following the procedures 
above, then the purified DNA is cut by at least one 
restriction enzyme corresponding to specific 
enzymatic cutting site present in the palindromic 
octamers or heptamers but absent from the 
110 expression vector which was utilized; this cutting is 
followed by inactivation of the restriction enzyme, 
then one simultaneously treats the ensemble of 
linearized stochastic DNA fragments thus obtained 
with T4 DNA ligase, in such a manner to create a 
115 new ensemble of DNA containing new stochastic 

*t.t» »»»n.v^Mn r-an thorofnrn rnntain 

SUqUBflCO:!, una iio»» cti»*3n»t-"vi -- — 

a number of stochastic genes larger than the 
number of genes In the initial ensemble. One then 
utilizes this new ensemble of transforming DNA to 
1 20 transform the host cells and clone these genes, and 
finally utilizes screening and/or selection and 
isolates the new clones of transformed host cells 
and finally these are cultivated to produce at least 
Olid pttpiiUe oi polyptiptidB. loi example, a new 
125 protein. 

The property serving as the criterion for selection 
of the clones of host cells can be the capacity of the 
peptides or polypeptides, produced by a given 
clone, to catalyse a give,, jhemica! reaction. 
130 For instance, for the production of several 
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peptides and/or polypeptides, th'd saidl)roperty can 
be the capadty to catalyse a sequence of reactions 
leading from an Initial group of chemical 
compounds to at least ohe target compound. 
5 With the aim of producing an ensemble 
constituted by a plurality of peptides and 
polypeptides which are reflexlvely dutocalalytic, the 
said property can be the capacity to catalyse the 
synthesis of the same ensemble from amino acids 

1 0 and/or oligopeptides in an appropnate milieu. 
The said property can also ba the capacity to 
modify selectively the biological or chemical 
properties of a given compound, fbr Example the 
capacity to selectively modify the catalytic activity of 

15 a polypeptide. 

The said property can also be the capacity to 

. stimulate. Inhibit, or modify at least one biological 
function of at least one, biologically active 
Compound, chosen, for example, among the 

20 hormones, neurotransmitters, adhesion factors, 
growth factor&and specific regulators of DNA • 
replication and/or transcription and/or translation of 
RNA. 

The said property can equally be the capadty of 
25 the peptide or polypeptide to bind to a given tigand. 

The invention also has as its object the use of the 
peptide or polypeptide obtained by the process 
specified above, for the detectiori and/or the titration 
of a ligand. 

30 . According to a particularly advantageous-mode of 
carrying out the invention, the criterion for selection 
^df^he clones of transformed host cells is the 
incapacity off these peptides or polypeptides to 
simulate or modify the effects of a biologically 
35'f active rnolecule; for example, a protein, and 

screening and/or selection for clones of transformed 
'^hpst cells produdng at least one peptide or 
^^^polypeptlde having this property, is carried out by 
preparing'Bntibodles against the.active molecule, 
40 Jhen utilizing these antibodies after their 
^r^purification, to identify the clones containing this 
* peptide or polypeptide, then by cultivating the 
clones thus identified, separating and purifying the 
peptide or polypeptide produced by these clones, 
45 and finally by submitting the peptide or polypeptide 
to an. in vitro assay to verify that it has the capacity 
to simulate or modify the effects of the said 
molecule. 

According to anolher mode of carrying out the 
50 process according to the invention, thu property 
serving as the criterion of selection is that of having 
at least one epitope similar to one of the epitopes ol 
a given antigen. 

The Invention carries over to obtairring 

55 polypeptides by the process specified above and 
utilizable as chemotherapoutlcally active 
substances, • 

In particular. In the case where the said antigen is 
EGF, the invention permits obtaining polypeptides 
60 usable for chomotherapoutic treatment of 
epitheliomas. 

According to a variant of the procedure, one 
identifies and isolates the clones of transformed 
host cells producing peptides or polypeptides 
65 having the property desired* by affinity 



chrbrriatography against antibodies corresponding 
to a protein expressed by the natural part of the 
DNA hybrid. 

For example, in the case whore the natural part of 
70 the hybrid DNA contains a gene expressing p- 

galactosidase, one can advantageously identify and 
Isolates the said dones of transformed host cells by 
affinity chromatography against anti-P- 
galactosidase antibodies. 
75 After expression and purification of hybrid 
peptides or polypeptides, one can separate and 
isolate their novel parts. 

The mvenflbn also applies to a use of the process 
specified above for the preparation of a vacdne; the , 
80 application Is characterized by the fact that 
antibodies against the pathogenic agent are 
Isolated, for example antibodies formed after 
Injection of the pathogenic agent in the body of an. 
animal capable of forming antibodies against this 
85 agent, and these antibodies are used to identify the 
clones produdng at least one protein having at least 
one epitope similar to one of the epitopes of the 
pathogenic agent, the transformed host cell 
corresponding to these clones are cultured to 
90 produce these proteins, this protein is isolated and 
purified from the clones of cells, then this protein is 
used for the production of a vacdne against the 
pathogenic agent. 
For example in order to prepare an anti-HVB 
95 vaccine, one can extract and purify at least one 
capside protein of the HVB virus, inject this protein 
into ah animal capable of forming antit>odies 
- ^againstthis protein having at least one epitope 
- similanlo one of the epitopes of the HVB virus,.then 
1 00 cultivate the dones of transformed host cells 
corresponding to these clones in a manner to 
produce this protein, isolate and purify the protein 
frorrvculture of these clones of cells and utilize the 
. protein for the production of an anti-HVB vaccirte. 
1 05 According to an advantageous mode of carrying 
out the process according to the invention, the host 
cells consist in bacteria such as Escherichia coli 
whose genome contains neither the natural gene 
expressing p-galactosidase. nor the EBG gene, that 
110 is to say, Z", EBG~ E coli. The transformed cells are 
cultured in the presence of X gal and the indicator 
IPTG in the medium, and cells positive for P* 
galactosidase functions are detected; thereafter, the 
transforming DNA is t*'3nsplanted into an 
1 1 5 appropriate done of host cells for large scale culture 
to produce at least one peptide or polypeptide. 

The property serving as the criterion for selection 
of the transformed host cells can also be the 
capadty of the polypeptides or proteins produced 
120 by the culture of these clones to.bind to a given 
compound. 

This compound'cen be in particular chosen 
advantageously among peptides, polypeptides and 
proteins, notably among proteins regulating the 
1 25 transcription activity of DNA. 

On the other hand, the said compound can also bo 
chosen among DNA and RNA sequences. 

The Invention has also as ; object those proteins 
which are obtained In the case where the property 
130 serving as criterion of selection of the clones of 



GB 2 183 66t A 3 



transformed host cells consist in the capacity of 
these proteins to bind to regulatory protems 
controlling transcription activity of the DNA, or else 
to DNA and HNA sequences. 
5 The invention hasjn addition, as an object the 

use of a protein which is obtainedin the first 
particular case above mentioned, as a cis regulatory 
setiuencte controlli ng replication or transcription of a 
neighboring DNA sequence. 
1 0 On the other hand, the aim of the invention also 
includes utilization of proteins obtained in the 
second case mentioned to modify the properties of 
transcription or replication of a sequence of DNA, in 
' a cell containing the sequence of DNA, and 
15 expressing this protein. 

The invention has as its object as well a process of 
production of DNA, characterized by simultaneous 
production in the same medium, of genes at least 
partially composed of stochastic synthetic 
20 polynucleotides, in that the genes thus obtained are 
introduced into host cells to produce an ensemble of 
' trarisforrriedhbst cells, in that screening end/or 

selection on this ensemtiie is carried uui io i 

those host cells containing in their genome 
25 stochastic sequences of DNA having at <eBf t one 
desired property, and rinally, in that the DNA from 
the clones of host cells thus identified is isolated 

The invention further has as its object a procedure 
to produce RNA, characterized by simultaneous 
30 production in the same medium, of genes at least 
* partially composed of stochastic synthetic 
polynucleotides, in that the genes thus obtained are 
introduced into host cells to produce an ensemble of 
transformed host celts, in that the host cells so 
35 produced are cultivated simultaneously, and 
screening and/or selection of this ensemble is 
carried out in a manner to identify those host cells 
containing stochastic sequences of having at 
least one desired property, and in that the RNA is 
40 isolated from the host cells thus identified. 

The said property can be the capacity to bind a 
given compound, which might be for example a 
peptide or polypeptide or protein, or also the 
capacity to catalyse a given chemical reaction, or the 
45 capacity to be a transfer RNA. 

Now the process according to the invention will 
be described in more details, as well as some of its 
applications, with reference to non limitative 
embodiments. 

f'irst, we shall describe particularly useful 
procedures to carry out the synthesis of siochasiic 
genes, and the introduction of those genes in 
bacteria to produce clones of transformed bacterid 

55 I) Direct Synthesis on an Expression Vector, 
a) Linearization of the Vector 

30 micrograms, that is. approximately 10 
molecules of the pUC8 expression vector are 
linoarizod by incubation fcr 2 hours at 37*C wtlh 1 00 

60 units of the PstI rbstriciion enzyme in a volume of 
300 1 of the appropriate standard buffer. The 
Jinearized vector is treated with phenol-chloroform 
then precipitated In elhanol. taken up in volume of 
30 1 and loaded onto a 0.8% agarose gel In standard 

65 TEB buffer. After migration In a field of 3 V/cm for 



three hours, the linearized vector elecuo-eluted 
precipitated in ethanol. and taken up m 30 1 of water. 

b) Stochastic Synthesis Using the Enzyme Terminal 

70 Transferase (TdT) ^ At^^o 

30 ug of the linearized vector are reacted tor 2 
hours at 37»C with 30 units of TdT in 300 1 of the 
appropriate buffer, in the presence of 1 dG l K, i 
rm dCTP, 0.3 mM dTTP and 1 mM dATP. The ]ower 
75 concentration of dTTP is chosen in order to reduce 
the frequency of "stop" codons in the 
corresponding messenger RNA. A similar result, 
aUhough sornewhat less favorable can be obta 

• by utilizing a lower concentration for dATP than for 
80 the other desoxynucleotide triphosphates. The 
progress of the polymerization on the 3' extremity 
of the PstI sites is followed by analysis on a gel of 
aliquotes taken during the course of the reaction. . 
When the reaction attains or passes a mean value 
85 of 300 nucleotides added per 3' extremity, it is 

stopped and the free nucleotides are separatedfrom 
the polymer by differential precipitation or by 
nn«:*nfl over a column containing a molecular sieve 
such a^s Biogel P60. After concentration by 
90 precipitation in ethanol, the polymeres are 

subjected to a further polymerization with TdT. first 
in the presence of dATP, then in the presence of 
dTTP. These last two reactions are separated by a 
filtration on a gel and are carried out for short 
95 intervals (30 seconds to 3 minutes) in order to add 
sequentially 10-30 A followed by 1 0-30 T to the 3 
ends of the polymers. 

c) Synthesis of the Second Strand of Stochastic DfsiA 
100 Each molecule of vector possesses, at the end pJ 
the preceding operation, two stochastic sequences 
* whose 3' ends are complementary. The mixture ot 
polymers is therefore incubated in conditions 
favoring hybridization of the complementary 

105 extremities (1 50 mf^ NaCI, 10 mM Tris-HCI. pH 7.6, 1 
mM EDTA at eS"* for 10 minutes, followed by 
lowering the temperature to 22''C al a rate of 3 to 4 
C per hour. The hybridized polymers are then 
' reacted with 60 units of the large fragment IKlenow) 
1 1 0 of polymerase 1 , in the presence of the four 

nucleotide triphosphates (200 mM) at 4X for two 
hours. This stop accomplishes the synthesis of the 
second strand from the 3' ends of the hybrid 
polymers. The molecules which result from this 
1 15 direct synthesis starting from linearized vector are 

.1 ..*;t;-yrtr4 >n trnncfnrm ComOOlont COHs. 

tllOIC7U«fcU( oi.i.*.*-— --^ 



d) Transformation of Competent Clones 
100 to 200 ml of competent HB101 of C600 at a 
120 concentration of 10»<» cells/ml, are incubated with 
iho stochastic DNA preparation (from above) in the 
presonco of 6 mM CoCls. 6 mM Tris-HCI pH8. 6 mM 
MgCI, for 30 minutes at 0"C. A tomporoluro shock of 
3 minutes nt 37^ Is imposed nn thn rnixturn, 

1 25 followed by the addition off 400 to 800 ml of NZY 
culture medium, without antibiotics. The 
transformed culture is incubutiid ol 3T' for 60 
minutes, then diluted lo 10 litres by uddilion of NZY 
medium containing 40 Mfl/m* nmpicillin. After 

130 3—5 »»our5 of Incnbniion at 37"C. the amplified 
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Culture iscentrifuged, and the pellet of transformed 
cells is lyophilysed and stored at —70^. Such a 
cuKure contains 3x 10' to 10* independent 
transform ants« each containing a unique stochastic 
5 gene irtserted into the expression vector. 

11) Synthesis of Stochastic Genes Starting From 
Ollgonudeottdes Without Cohesh^e Ends. 
This procedure is based on the fact that . 

1 0 polymerization of judiciously chosen palindromic 
oligonucleotides permits construction of stochastic 
genes which have no "stop** codon in any of the six 
possible reading frames, while at the same time 
assuring a balanced representation of triplets 

1 5 specifying all amino acids. Further, and to avoid a 
rspotition of sequence motifs in the proteins which 
resu't, the oligonucleotides can contain a number of 
bases which is not a multiple of three. The example 
which follows descrit>e3 the use of ore of the 

20 possible combiriations which fulfil these criteria: 

a) Choice of a Group of Octamers 
The group of oligonucleotides following: 

25 5'6GAATTCC3' 
5' 6GTCGACC3* 
5' CAAGCTTG 3' 
5' CCATATGG 3' 
5* CATCGATG 3' 

30 

is composed of 5 palindromes (thus self 
complementary sequences) where it is^asy to verify 
that their stochastic^polymeri^Btion 'does not 
gerferate any -'stopr codons;ahd spiecifies all the 
. 35 amino adds. 

Obviously, one can utilize other group of 
palindromic octamers which do not generate any 
"stop" codons'and speclfy^all the amino adds found 
in polypeptides. Clearly, it is also possible to utilize 
^ '40 non palindromic groups of octamers, or other 
oligomers, underthu condition that their 
complements forming double stranded DNAare 
also used. 

45 b) Assembly of a Stochastic Gene From a Group of 
Octamers 

A mixture containing 5 pg each of the 
oltgonudeotides indicated above (previously 
phosphorylated at the 5' position by a standard 

50 procedure) Is reacted In a 100 ul volume containing 
1 mM ATP. 10% polyelhyleneglycol, and 100 units 
of T4 DNA ligasa in the appropriate tnjffer at 1 3^0 for 
six hours. This step carries out the stochastic 
polymerization of the oligomers In the double 

55 stranded state and without cohesive ends. The 
resulting polymers are isolated by passage over a 
moiecutar sieve (Biogel P60) recovering those with 
20 to 100 oligomers. After concentration, this 
fraction is again submitt^;d to catalysis or 

60 polymerization by T4 DNA ligase under the 
conditions described above. Thereafter, as 
described above, those polymers which have 
assembled at least 100 oligomers are isolated. 

65 



c) Preparation Of The Host Plasmid 

The pUC8 expression vector is linearized by Smal 
enzyme In the appropriate buffer, as described 
above. The vector linearized by Smat does not have 

70 cohesive ends. Thus the linearized vector is treated 
by calf intestine alkaline phosphatase (CIP) at a level 
of one unit per microgram of vector in the 
apppropriate buffer, at 37*C*for 30 minutes. The CIP 
enzyme is thereafter inactivated by two successive 

75 extractions with phenol-chloroform. The linearized 
and dephosphorylated vector is precipitated in 
ethano), then redissoh^ed in water at 1 mg/ml. 

d) Ligation of Stochastic: Genes To The Vector 

80 Equimolar quantities of vector and polymers are 
mixed and incubated in the presence of 1000 units 
of T4 DNA ligase, 1 mM ATP, 10% polyethylene 
glycol, in the appropriate buffer, for 12 hours at 
13°C. This step Itgates the stochastic polymers In the 

85 expression vector and forms double stranded 
circular molecules which are. therefore, capable of 
transforming. 

Transformation of Competent Clones 
90 Transformation of competent clones is carried out 
' In the manner previously described. 

Ill) Assembly of Stochastic Genes Starting From'A 
Group Of Heptaniers 
95 This procedure differs from that just discussed in 
that it utilizes palindromic heptamers which have 
variable cohesive ends, in place of stochastic 
sequences containing a smaller number of idehtical 
' motifs. 

100 

a) Choice of a Group of Heptamers 

It is possible, as an example, to use the following 
three palindromic heptamers: 

105 5'XTCGCGA3' 
5' XCTGCAG 3' 
5' RGGTACC3* 

where X= A, G. C, or T and Ra A or T, and where 
110 polymerization cannot generate any "stop" codons 
and forms triplets specifying alt the amino acids. 

Clearly it is possible to use other groups of 
heptamers fulfilling these same conditions. 

115 b) Polymerization of a Group of Heptamers 

This polymerization is carried out exactly in the 
fashion described above for octamers. 

c) Elimination Of Cohesive Extremities 
120 The polymers thus obtained have one unpaired 
base on their two 5' extremities, thus, it is 
necessary to add the complementary base to the 
correspionding 3' extremities. This is carried out as 
follows: 10 micrograms of the double stranded 
125 polymers are reacted with 10 units of the Klenow 
enzyme, in the presence of the four 
deoxynucleotidephosphates (200 mM) in a volume 
of 100 Ml. al 4'*C. for 60 minutes. The enzyme is 
inactivated by phenol choloroform extraction, and 
1 30 the polymers are cleansed of the residual free 
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nudwtide5bvdi«crent.alpri«.p.tBUon.The 

DOlvmers are then ligaied to the host plasmid 
KouslylineariiedanddephosphorvlM^^^ 

followlnfl the procedures described above, 
5 UM:brr«.tedthatlhetvwraslprocedur«w^ 
were described utilUe palindromic ociamere or 
^^Lm^v^hichconstitutespecincsitesof^^^^^^^^ 

reSriction enrymes^These sites are absent, for the 
^rtMftfrOT the pUC8 expression vector. Thus, .t 
,0 r^bTe^ruflZtconslderablythecornplexIty 
'° SK.lalp«parat.onofs,ocha*«c9.^^^^^^^ 
proceeding In the following way: the plBsrri,d DNA 
derivedfrotntbecunureoflO^ndependent 
uansformants obtained by onoofthetwola^ 
15 procedures described above, is isolated. After this 
'^KspurinedJt is partially digested by^^^^^^ 

restriction enzyme (procedurell) or by the PrtI 
rSSoneSmo (procedure HI). After inaclivat^n 
S?hTeSiymWpartiallvdigestedDNAistreatedw,,h 
20 ?40NAT!flase.whichhasthee«ectofcreabn9a 

::rr;^^*elu-ndamentalprope^^^^^^^ 
ieauences. This new ensemble of stochastic 
"renSscanthenbeusedtotransfo^com^^^^^^^ 
w cens Inaddition.lhoslocha5ticgenesclonedby 
frocXellandlllcanbeexcisedintact romthe 

Mcl expression vector by utilizing restriction sites 
KflTngtotheclonlngvectorandnotrepresented 

in the stochastic DNA sequences. 

30 Recombination within the s«°cha8t'C genes 
nnneratedbv the two procedures just descnbed. 

S^;:suKom.he^r^a.homology^^^^ 
recurrent molecular motifs. IS an important 
addilSnllmethodto achieve //.wVomutagenesisof 

35 the coding sequences. This results in an 

augmentation of the number of new genes which 
can be examined. , 
Finally, for all the procedures to generate novel 
synthetic genes, it is possible to use a number of 

•40 SmoMechniques to modify genes ,r.wvo or n 
S^chasachangeof reading frame, nversion 

of sequences with-respect ^P^'»^•°''P°'"' 
mutalions. or utilization of host cells expressing one 
or several suppressor tRNAs. 

Ati In considering the above description, it is clear 
th«?^spossible to construct, in vltro.anext«mely 

taroenumber(forexamplegreaterthanab.ll.on) 
different genes, by enzymatic polymerization of 

50 polymerization Is carried out In a stochastic manner. 
^ M drtermined by the respective concentrations of 
the nucleotides or oligonucleotides present m 'he 

'"SndlTiSd ;i,ove. two methods can be utilized 
55 to clone such genes (or coding 

polymerization can be carried out d'^sctly on a 
cloning expression vector, which was previously 
nnearizod: or it Is possible to proceed sequentially 
to th« polym«rtzBtlon then the ligation of ths 
RH oolvmors 10 the expression vector. 

n Se S«o cases, the next step is transformation or 
ua JsfecS^n of competent bacterial cells lor ceHs ir, 
SetS^is ..op constitutes clonlngthesiochastic 

oone. In living colls whore they ore Inder.n.lolv 
65 propsgstod and expressed. 



Clearly, in addition to the procedures v,^h.ch were 
described above, it is feasible to use aH ot^er 

methodswhich are appropriate orthe 

stochasticsequences. In P^^''!"'^/' ^tCa^s 
70 carry out polymbrization. by b'ochemiral means, ot 
s nje stranded oligomers of DNA or RNA obteined 
by 'hemical synthesis, then ueat these segmeotsof 

DNA or RNA by established procedures to generate 
SJobfe trended DNA (cDNA) in order to clone such 
75 genes. 



ScreeningOrSelectionOfClonesOfTransformed 

"T^eSherstepofthe procedure according^^^^^^^^ 

80 inventionconsistslnexamininglhetransformed^^^^ 

ransfected cells by selection or screening, in o der 

o rsoS one or sXeral cel|s v^<- ^-^^^^^^^^ " 
transfectingONAIeadstothesynthesisofa 

transcription product IRNA) or uans1«^n prod^t^ 

85 (protein) having desired P'^Pf^' f^'^f "^'^ 
can be. for example, enzymatic, functional, or 

TneoShe most important aspectsofthepr^^^^^^ 
according to the invention, is that it permits the 
90 simultaneous screening or s^l*";''?" . „„„ 
exploitable producKRNA or protein and the gene 
Sch produces that product, in addition, the DNA 

svnthetized and cloried as described, can be 
Sed or screened in order to isolate sequences of 
95 ONAconslituting products in themselves, havmg 
exploitable biochemical properties. 
We shall now describe, as non-limiiating 

examples, preferred procedures ^r screening or 
siec'onofclones of transformed ceHssu^^^^^^^ 

100 novel proteins are of Interest from the point of view 
of industrial or medical applications. 

One of these procedures rest in the idea of 
produdng. or obtaining polyclor>alormonodonal, 

antibodies, by established techniques direrfe^^ 
105 against a protein or another typo of molecule of 
biochemical or medical interest, where that 
molecule is, orhas been rendered. mmunogenic 
and thereafter using these antibodies as Pr^^^^ 

Identify among the very large number of cto^^^^^^ 
no transformed by stochastic genes, *ose v^h°se 
protein reactwith these ^"^'bodies This reacUon 
a result of a structural homology which exists 
between the polypeptide synthesized by the 
stochasticgeneandtheinitialmote^^^^^^ 

115 possible in this way ioisu.«.»^.. :' ,„.„|- 

Proteins which behave as «P"«fP«'' 
Lerminantsontheinitial molecule Such nov^^^^^^ 

proteins are liable to simulate, "'^J"'? ' 

Sr block the effect oftheinitia molecule ItwHIbe 

1 20 clear that this means of selection or screening may 
itself have very many Phar'"^"'"'^'""^ .. „ . 
biochemical applications. Below wo describe, as a 
SmTting example, this first mode of operation in 

1 25 ' S.Tpidermal growth facto.) is a small protein 
presen in the blood, whose role is to simulate the 
P ov^h of epithelial cells. This eHect is obtained by 
The interaction of EGF with a specific rcceprtor 
situated in the membrane of epithelial cells. 
130 Antibodies directed against EGF are prepared by 
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injecting animals with EGF coupled to KLH (keyhole 
limpet hemocyanin) to augment the 
Immunogenlclty of the EGF. Tho antf-EGF 
antibodies of the Immunized animals are purifled, 

5 for example, by passage over an affinity column, 
where the llgand is EGF or a synthetic peptide 
corresponding to a fragment of EGF. lIra purified 
. antl-EGF antibodies are then used as probes to 
screen a large number of bacterial' clones lysed by 

1 0 cloroform, and on a solid support The antl-EGF 
antibodies bind those stochastic peptides or • 
proteins whose epitopes resemble those of the 
initial antigen. The clones containing siich peptides 
or proteins are shown by autoradiography after 

1 5 Incubation of the solid support with radioactive 
protein A, or after Incubation with a radioactive anti- 
antibody antibody. 

These steps identify those clones, each of which 
contains one protein (and its gene) reacting with the 

20 screening antibody. It is feasible to screen among a 
very large numt>er of colonies of bacterial cells or 
viral plaques (for example, on the order or 
1 'OOO'OOO) and it is feasible to detect extremely 
small quantities, on the order of 1 nanogram, of 

25 protein product. Thereafter, the identified clones are 
cultured and the proteins so detected are purified in 
conventional ways. These proteins are tested in 
vitro in cultures of epithelial cells to determine if 
they inhibit, simulate, or modulate the effects of EGF 

30 on these cultures. Among the proteins so obtained, 
some may be utilized for the chemotherapeutic 
treatment of eptthetibmes. The activlties-of^he 
proteins thus obtaihedfcan be Improved by 
mutatibn^of the DNA codingiorrthe proteins, in 

35 ways ahalogous to those described. above. A variant 
of this procedure consists In purifying these 
stochastic peptldes,poly peptides or proteins, which 
^ -lean be used as vaccines or more generally to confer 
an immunity against a pathogenic agent or to 

4p}exercise other effects on the immunological system, 

'^«f/or.ex8mple, to create a tolerance or diminish 
^^hypersensitivity vyith respect to a giyeri antigen, in 
particular due to binding of these peptides, 
polypeptides or proteins with the antibodies 

45 directed against this antigen. It is clear that it is 
possible to use such peptides, polypeptides or 
proteins in vitro as well as in viva. 

More precisely, in the ensemble of novel proteins 
which react with the antibodies against a given 

50' antigen X, each has at least one epitope in common 
with X, thus the ensemble has an ensemble of 
epitopes in common with X This permits utilization 
of the ensemble or sub-ensemble as a vaccine to 
confer Immunity against X. It Is, for example, easy to 
. 55 purify one or several of the capsid proteins of the 
hepatitis B virus. These proteins can then be 
injected tnta an animal, for example, a rabbit, and 
the antibodies corresponding to the initial antigen 
can ba recovered by affinity column purification. 

60 These antibodies may be used, as described above, 
to identify clones producing at least one protein 
having an epitope resembling at least one of the 
epitopes of the initial antigen. After purification, 
these proteins are used as antigens (either alone or 

65 in combination) with the aim of conferring 



protection against hepatitis B. The final production 
of the vaccina does not require further access to the 
initial pathogenic agent. 
Note that, during the description of the 

70 procedures above, a number of means to achievo 
selection or screening have been described. All 
these procedure may require the purification of a 
particular protein from a transformed clone. These 
protein purifications can be carried out by 

75 established procedures and utilize, in particular, the 
techniques of gel chromatography, by ion 
exchange, and by affinity chromatography. In 
addition^ the proteins generated by the stochastic 
genes can- have been cloned in the form of hybrid 
80 proteins ha^ ing, for example, a sequence of the P- 
galactosidase enzyme which permits affinity 
chromatography against anti-p-galactosidase 
antibodies, and allows the subsequent cleavage of 
the hybrid part (that is to say, allowing separation of 
85 the novel part and the bacterial part of the hybrid 
protein. Below we describe the principles and 
procedures for selection of peptides or polypeptides 
and the corresponding genes, according to a second 
method of screening or selection based on the 
90 detection of the capacity of these peptides or 
polypeptides to catalyse a specific reaction. 
As a concrete and non limiting example, 
t screening or selection in the particular case of 
proteins capable of catalysing the cleavage of 
95 lactose, normally a function fulfilled by enzyme P- 
- galactosidase (p-gal) will be described. 
; v..^As above described, the first step of the process 
^^cohsists In generating a very large ensemble of 
^expression vectors, each expressing a distinct novel 

1 00 protein. To be concrete, for example, one may 

choose the pUCS expression vector with cloning of. 
stochastic sequences of ON A in the Pstl restriction 
site. The plasmids thus obtained are then 
introduced in a clone of E coll from whose genome 

105 the natural gene for p-galsctosidase, Z, and a . 
second gene EBG, unrelated to the first but able to 
mutate towards p-gal function, have both been 
eliminated by known genetic methods. Such host 
cells (Z", EBG*) are not able by themseh/es to 

110 catalyse lactose hydrolysis, and as a consequence to 
use lactose as carbon source for growth. This 
permits utilization of such host clones for screening 
or selection for p-gal function. 
A convenient biological assay to analyse 

115 transformed E coll clones for those which have 

novel genes expressing a p-gal fiinction consists in 
the culture of bacteria transformed as described in 
petri dishes containing X-gal in the medium. In this 
case, all bacterial colonies expressing a p-gal 

1 20 function are visualized as blue colonies. By using 
such a biological assay, it is possible to detect even - 
weak catalytic activity. The specific activity of 
characteristic enzymes ranges from 10 and 10.000 
product molecules per second. 

1 25 Supposing that a protein synthesized by a 

stochastic gene has a weak specific activity, on the 
order of one molecule per 100 seconds, it remains 
possible to detect such catalytic activity. In a petri 
dish containing X-gal in the medium, and in the . 

130 presence of the non metabolizable inducer IPTG 



region requires cleavage of abo" » W 
molecules of X-9»l par square mill meter. A 
baSlcolonyexpressingaweaken^^^^^^^^^^^ 
K nccuDVlna a surface area of imm* has about 10 to 
^ "MnfeachcellhasonlVonecopvoflheweak 
onzjme. each cell would need to Mtalysecleavafle 
dbe^Uen 10-000 andlOOofx-gatobe^^^^^^^^^ 
which would require between 2.7 and *70 hours. 
10 SwuTder selective conditions it is possibleto 
'^SSfylhe number of coplesofeachjasn^d per 

ce^l to 5 to 20 copies per cell, or even to 100 to 1000 
andbecauseupto10%ofthe.proteinofthece«can 

• b^specifledbvthenewgene.thedura«onne^^^^^ 

in to detect a colony blue in the case of 100 enzyme 
^^■moTcculesofweakactivityperceinsontheorderof 

' *''i'iconieqTanceofthesefacts,screenin9avery 

larae number of independent bacterial colonies. 
20 e'JJrxpressingadifferentnove^^^^^^^^^ 

«r^out screening of aboiit 2000 colon.es in one 
PetS of 10 cm diameter. Thus, about 20 million 

25 Ses can be screened on a sheet of X gal agar of 

^ k "sVo b^ noled that bacterial ~lonies which 

• anoear blue on X gal Petri dishes might be false 
Sves due to a mutation in the bacteria genome 

30 which confers on it the capacity to metabol«e 
^^I^Soseorforotherreasonsthan those which result 

from a catalytic activity of the novel Pfotem 
expressed by the cells of the colony. Such also 
positivescanbedirectlyeliminatedbypuriWmgthe 

35 DNA of the expression vector from the positive 
colony, and retransformingZ--,EBG E^olihost 
~lls If the B-gal activity is due to the novel protein 
Siythlnewgeneintheexpressionvec^^^ 
'hose cells transformed by that vector will exh^^^^^^ 

40 gal function. In contrast, if the Wt.a '^^^ ^ ,^ 
due to a mutation in the genome of the host cell, it « 
a rare event and independent of the transformation, 
thus the number of cells of the new«lone«f - 
transformed E coli capable of expressing gal 

A5 function will be small or zero. 

The power of mass simultaneous punficauon of 
all the expression vectors from all the positive 
clones«blue>followedbyretransformauonofnan/e^ 

bacteria should be stressed, suppose ...o. - 
SO to carry out a screening to select proteins having a 
^ Starve function, and that the probabi ity that a new 
peptil or polypeptide carriesout lhisfu^^^^^^^^^ 
least weakly is 10-, while the probability that a 
clone of the E coli bacterial host undergoes a 
55 mutation rendering it capable of carrying out he 
^ame function is 10-. then it can be cf Iculated that 
among 20 million uansformed bacteria which are 
keened, 20 positive clones will be attnbutable to 
.he novel genes in expression vectors which each 
60 carr^s. while 200 positive clones will be the result of 
genomic mutation. Mass punfical.on of the 
Expression vectors from the total of 220 positive 
bacterial clones followed by relransformat.on of 
SaSe bacteria with the mixture of these expression 
65 vectors will produce a large number of positive 



clones consisting of all those b«ct« ""^^^^^^^^^ 
vWlhthe20expresslonveclorswhchcodefortho 

novel proteins having the desired <"n«l°";, * _ 
^ery small number of bacterial c 
70 genomic mutations and containing the 2ro 

expression vectors which are not of Interest. A small 
numberofcyclesofpuriflcatlonoexpress^^^^^^^ 
vectors from positive bacterial colonies, olloweo oy 
S;et«nsfo^ma,ion.Bll.««.he^^^^^^^^^ 
75 rare expression vectors truly positive " des ™a 
SaMic activity, despite a high background rate of 
mlSnslthehostcellsforthesan^^^^ 

Following screening operations of this type, it's 
poslTto'purify the new P"tein by es^bl-^^^^ 
80 techniques. Theproductionofthatprotenlnlarge 

nuantitv is made possible by the fact that 
SKon of tEeusefu. protein oc^^^^^^^^ 
y«thsimultaneousidenliricationofthegene«^^^^^ 
for the same protein. Consequently, either he s^^^^ 
85 expressionvectorcanbeused.orthenoyB^ene 
can be transplanted into e more^PP™PnateJ^„ 
expression vector tor us syniimo-o 

'"ffiSto apply thismethod of screerjingfor 
90 anv enzymatic function for which an appropr ate 
b^'logirassay exists. For such screenings .fs not 
necessary that the enzymatic function whichjs 
St be useful to the host celKItispossibleto 
cari? out screeningsnol only for anenzymahc 

95 function but for any other desired property for 
whichitispossibletoestablishanappropnate 
rolo9icalassay.ltisthusfeasibletocarivout.eve^ 

in the^simple case of P-gal '""f <>" ^'f "J'^f °" „ 
X-gal Petri plate, a screening of on the order of 100 
100 million, or even a billion novel genes for a catalytic, 
activity or any other desired property. 



Selection Of Transformed Host Cells 
On the other hand, it is possible to use selection 
105 techniquesforanypropertv.catalyticorolhen^^^^^ 
where the presence or absence of the P;oP« JV c«n 
be rendered essential for the survival of the host 
cells containing the expression ^-ectors which code 
for the novel genes, or also can be "''^J 
1 10 those viruses coding and expressing the desired 
novel gene. As a non-limiting, but concrete 
Sample, selection for p-galactosidase 
shall be described. An appropnate "^'o"* ^ EBG 
E. coli is not able to grow on lactose as the sole . _ 
115 carbon source. Thus, after carrying oui i"" h 
deSibed above, it is possible to culture a very large 

Jur^ber of host cells transformed by ^^^'^^^^^ 
vectors coding for the novel genes, under selective 
conditions, either by progressive d-mmut^n ol 
120 othersourcesofcarbon,orotilizationoflactose 
alone from the start. During t^e^course of such 
selection,//, vivo mutagenesis by '<>,'''''"^l"^^l''";^' 
by explicity recovering the expression vectors and 
mutajenizlg their novel genes in vitro by various 
125 mutagens, or by any other common technique, . 
permUs adaptive improvemente in «he capacir/ to 
fulfil the desired catalytic function. When both 
selection techniques and convenient b'oassay 
techniques exist at the same time, as in the present 
130 case, it is possible to use selection techniques 



B 



GO 2 183 661 A 8 



initially to enrich the representation of host bacteria 
expressing the p-gal function, then carry out a 
screening on a Petri plate on X-gel medium to 
establish efftclentty which are the positive cells. In 

5 the absence of convenient bloassays, application of 
progressively stricter selection Is the easiest route to 
purify one or a small number of dlstmct host colls 
whose expression vectors code for the proteins 
catalysing the desired reaction. 

10 It is possible to utilize these techniques to find 
novel proteins having a large variety of structural 
and.functlonal characteristics beyond the capacity 
to catalyse a specific reaction. For example, it is 

. possible to carry out a screen or select for novel 

1 5 proteins which bind to ds-regulatory sites on the 
UNA and thereby block the expression of one of the 
host cell's functions, or block transcription of the 
DNA, stimulate transcription, etc. 
For example. In the case of E coli, a clone mutant 

20 in the repressor of the lactose operon (i-) expresses 
P-gal function cohsiituthrely du^ to the fact the 
lactose operator is not repressed. All cells of this 
type produce blue clones on Petri plates containing 
X*gat medium. It is possible to transform such host 

25 celts with expression vectors synthesizing novel 
. proteins and carry out a screen on X-gal Petri plates 
in order to detect those clones which are not blue. 
Among those, somcf represent the case vt^ere the 
new protein-binds to the lactose operator and 

30 represses the synthesis of ^al. It is then feasible to 
mass isolate such plasmids. retransform, isolate 
*^^those clones which do hot produce p-gal/and 
^^•thereaftericarryjout a detailed, verification. 

As mentioned above;'the process can be utilized 

35 in order to create then isolate, not only exploitable 
proteins, but also RNA and DNA as products in 
4|hemselves; having exploitable properties. This 

results from the fact that, on one hand, the 
v^proceduretronsisls in creating stochastic sequences 

40'^of DNA which may interact directly with other 
;?fecellular or biochemical constituents, and on the 
a iother hand,-these sequencesirroned in expression 
vectors are transcribed into RNA which are 
themselves capable of multiple biochemical 

45 interactions. 

An Example Of The Use Of The Procedure To Create 
And Select For A DNA Which Is Useful In Itself. 
This example illustrates selection for a useful 

50 DNA, and the puriftcation and study of the 

mechanism of action of regulatory proteins which 
bind to the DNA. Consider a preparation of the 
oestradiol receptor, a protein obtained by standard 
techniques. In the presence of oestradiol, a steroid 

55 sexual hormone, the receptor changes 

conformation and binds tightly to certain specific 
sequences in the genomic DNA, thus affecting the 
transcription of genes implicated in sexual 
difXer entiaUoii and Uie control of fertility. By 

60 Incubating a mixture containing oestradiol, its 
receptor, and a large number of different stochastic 
DNA sequences inserted in their vectors, followed 
by filtration of the mixture across a nitro cellulose 
memtnane, one has a direct selection for those 

65 stochastic DNA sequences binding to the oestrogen- 



receptor complex, where only those DNAs bound to 
a protein are retained by the membrane. Affer 
washing and etution, the DNA liberated from the 
membrane Is utilized as such to transform bacteria. 

70 After culture of the transformed bacteria, the vectors 
which they contain ercr again purified and one or 
several cycles of Incubation, filtration and 
transformation are carried out as described above. 
These procedures allow the isolation of stochastic 

75 sequences of DNA having an elevated affinity for the 
oestradiot-receptor complex. Such sequences are 
open to numerous diagnostic and pharmacologic 
applications, in particular, for developing synthetic 
oestrogens for the control of fertility and treatment 

80 of sterility. 

Creation And Selection Of An RNA Useful In itself 
Let there be a large number of stochastic DNA 
sequences, produced as has been described and 

85 cloned in an expression vector. It follows that the 
RNA transcribed from these sequences in the 
transformed host cells can be useful products 
themselves. As a non limiting example, it is possible 
to select a stochastic gene coding for a suppressor 

90 transfer RNA (tRNA) by the following procedure: A 
large number (^10*) of stochastic sequences are 
transformed into competent bacterial hosts carrying 
a "nonsense" mutation in the arg E. gene. These 
transformed bacteria are plated on minimal medium 

95 without arginine and with the selective antibiotic for 
that plasmid (ampicillin if the vector is pUC8). Only 
those transformed bacteria which have become 
capable of synthesizing arginine will be able to 
grow.This phenotype can result either from a back 
1 00 mutation of the host genome, or the presence in the 
^ cell of a suppressor. It Is easy to test each 
transformed colony to determine if the arg+ 
phenotype is or is not due to the presence of the 
stochastic gene in its vector; it suffices to purify the 

105 plasmid from this colony and verify that it confers 
an Arg + phenotype on all arge E cells transformed 
by it: 

Selection Of Proteins Capable Of Catalysing A 

110 Sequence Of Reactions 

Below we describe another means of selection, 
open to independent applications, based on the 
principle of simultaneous and parallel selection of a 
certain number of novel protuins capable of 

115 catalysing a connected sequence of reactions. 

The basic idea of this rinethod is the following: 
given an initial ensemble of chemical compounds 
considered as building blocks or elements of 
construction from which it is hoped to synthetize 

1 20 one or several desired chemical compounds by 
means of a catalysed sequence of chemical 
reactions, there exists a very large number.of 
reaction routes which can be partially or completely 
substituted for one another, which are all 

125 thermodynamically possible, and which lead from 
the set of building blocks to the desired target 
compound{s). Efficient synthesis of a target 
compound is favored if each step of at least one 
reaction pathway leading from the building block 

1 30 compounds to the target compound is comprised of 



5 previous desciiption. wo have sno 

stochastic proleinsispresBrt in a 
containing the building blo^ con^po^ 
15 asufnctentlyjargenu^^^^^^^ 

^ :^ss:reTc;v.:sctL^^^^^^^^^^ 

''iTpSfy the desired set of compounds 

JnSn'^gthehundjngU^^^^^^^ 
. . Preferentiauyareason^^^^^^^^ 

TSTra;Ur-.3tevo,um^^^^^^^^^^^ 

formed. If it is, ^o"''"?'"^^^ , proteins. If so. 
the presence of proteins 

40thenthemixtures^uldco^^^^^^^^^^ 



hydrolyslsandoccursefhcien^^^^^ 
anappropriateenzymatc^^^^^^^ 
80 pepsin ortrypsin, without requiring t f 

^TP or other high energy 2g;"3"i*^^^^ 
roactionmixtureofsmall^^^^^^^ 

?S'.^^"?.rctitiru nT^^^^^ 

85.;e'u^«J;SS-.yW9bconcentre.ionsto.eadto 
condensation reactions.^ ^^^^^ j^„^^. 

•faster and simpler lo carry ou^. _ ^^„i„ved. 



building bloclcs. ,„,_,ed in the conditions 
. 'iasctibedabove.lua.notwhe^^^ 

"Sot^S'siep consists in .he separation of 
thrpaltSrslbsetofcellswhichco^^^ 
130 expression vectors with the novel proteins 
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catalysing the* sequence of reactions leading to the . 
target pentapcptida. As an excmple, tf the number 
of reactions forming this sequence Is 5, there are 
about 5 novel proteins which catalyse the necessary 
5 reactions. If the clone bank of bacteria containing 
the expression vectors which code for the novel 
genes has a number of distinct novel genes which is 
on the order of 1 'OOO'OOO all these expression 
vectors are isolated en masse and retransformed 

1 0 into 1 00 distinct sets of 10* bacteria at a ratio of 
vectors to bacteria which is sufficiently low that* on 
av0rage/the number of bacteria in each set which 
are transformed is about half the number of initial 
genes, l!e. about 500*000. Thus, the probability that 

1 5 any gh^en one of the 1 00 sets of tiacteda contains 
the entire set of 5 critical novel proteins Is (1/2)*"1/ 
32. Among the 100 initial sets of bacteria* about 3 
will contain the 5 critical transformants. In each of 
these sets, the total number of new genes is only 
' 20 500*000 rather than 1 '080- 80a. By successive- • 
repetitions the total number of which is about 20 in 
the present case, this procedure isolates the 5 
critical novel genes. Following this, mutagenesis 
and selection on this set of 5 stochastic genes allows 

25 improvement of the necessary catalytic functions. In 
a case where it is necessary to catalyse a sequence 
of 20 reactions and 20 genes coding novel proteins 
need to be isolated in parallel, it suffices to adjust 
V the muKipficity of transformation such that 6ach set 

30 of 10^ bacteria receives 80% of the 10* stochastic 
f genes, and to use 200 such sets of bacteria. The 
^^prG&abilfty that alfeO n^^^^ found in 

. pnesiatis dS^sO^o'lS.'^ThUs^about 2 am the 200 
sets will4)ave the.20fiovel genes which are needed 

35 to catalyse the formation of the target compound. 
^ The number of cycles required for Isolation of the 20 
novehgenes -is on^the order of 30. 

The principles and procedures descrit>ed above 
generalize from the case of peptides to numerous 

40^areas of chemistry in which chemical reactions take 
place in aqueous medium, in temperature, pH. and 
concentration conditions which permit general 
. enzymatic function. In each case it is necessary to 
make use of an assay method to detect the 

45 formation of the desired target compound{s). It is 
also necessary to choose a sufficiently large number 
of building block compounds to augment the 
number of reaction sequences which lead to the 
target compound. 

50 The concrete example which was given for .the . 
synthesis of a target pentapepttde can also be 
generalized as follows: 

The procedure as described.-generattfsarrYcmg * 
. other products, stochastic peptides and proteins. 

55 These peptides or proteins can act, catalytically or in 
other ways; on-other compounds: -They can^equally 
constitute the substrates on which they act. Thus, it 
is possible to select (or screen) for the capacity of 
such stochastic peptides or proteins to interact 

60 among themselves and thereby modify the 

conformation, the structure or the function of some 
among them. Similariy, it is possible to select (or 
screen) for the capacity of these peptides and 
proteins to catalyse among themselves, hydrolysis, 

65 condensation, transpeptldation or other reactions 



modifying the peptides. For example, the hydrolysis 
of a given stochastic peptide by at least one member 
of the set of stochastic peptides and proteins can be 
followed and measured by radioactive marking of 
70 the given protein followed by an incubation with a 
mixture 6f the stochastic proteins in the presence of 
ions such as Mg, Ca, 2n, Fe and ATP or GTP. The 
appearance of radioactive fragments of the marked 
protein Is then measured as described. The * 
75 stochast'c protein(s) which catalyse this reaction 
can again be Isolated, along with the gene{s) 
producing them, by sequential diminution of the 
library of transformed clones, as described above. 
An extension of the procedure consists in the 
80 selection of an ensemble of :;tochastic peptides and 
polypeptides capable of catalysing a set of reactions 
leading from the initial building blocks (arnino adds 
and small peptides) to some of the peptides or 
polypeptides of the set. It is therefore also possible 
85- toseiect an ertsembte'ctfpabfe of catdiy^lng its own 
synthesis; such a reflexively autocatalytic set can be 
established in a chemostat where the products of 
the reactions are constantly diluted, but where the 
corrcentration of the building blocks is maintained 
90 constant Altematlvety, synthesis of such a set is 
• aided by enclosing the complex set of peptides in - 
liposomes by standard techniques. In a hypertonic 
aqOeous environment surrounding such liposomes, 
cond*ensatfoh reactiori forming larger peptides 
95 lowers the osmotic pressure inside the liposomes, 
drives water molecules produced by the 
~ condensation reactions out of the liposomes, hence 
favors synthesis of larger polymers. Existence of 
such an autocatalytic ensemble can be verified by 

1 GO two dimensional gel electrophoresis and by HPLC, 
showing the synthesis of a stable distribution of 
peptides and polypeptides. The appropriate reaction 
- volume depends on the number of molecular 
species used, and the concentrations necessary to 

105 favor the formation of peptide bonds over their 
hydrolysis. The distribiUian^f^cnolecuiarspecies of 
an autocatalytic ensemble is free to vary or change 
due to the emergence of variant autocatalyUc 
ensembles. The-peptides and polypeptides which 

110 constitute an autocatalytic set may have certain 

elements in common with the targe initial ensemble 
(constituted of coded peptides arid polypeptides as 
given by out procedure) but can also contain 
peptides and polypeptides which are not coded by 

115. the ensemble^tstocKastie genes coding -for the 
initial ensemble. 

The set of stochastic genes whose products are 
necessary to establish ^uch an autocatalyti6 set can 
be isolated as has been described, by sequential 

1 20 diminution of the library of transformed clones. In 
' * addltfon, an* autocatalytic set can contain coded 
peptides initially coded by the stochastic genes and 
synthesized continuously in the autocatalytic set. To 
isolate this coded subset of peptides and proteins, 

1 25 the autocatytic set can be used to obtain, through 
Immunization in an animal, polyclonal sera 
recognizing a very large number of constituents of 
the autocatalytic set. 
These sera can bo utilized to screen the library of 

1 30 stochastic genes to find those gones oxpressirig 
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proteins able lo combine with the antibodies 
present in the sera. 
This set of stochastic genes expresses a large . 
• number of coded stochastic proteins which persist 

5 In the autocatalytic set The remainder of the coded 
constituents of such an autpcatalytic set can be 
isolated by serial diminution of the library of 
stochastic genes, from which the subset dete' ed by 
immunological methods has first been subu acted. 

10 Such atltocatalytic sots of peptides and proteins, 
obtained as noted, may find a number of practical 
applications. 



15 1 . Process for the production of peptides or 
^ polypeptides by microbiological means, 
characterized in that genes which are at least 
partially composed of stochastic synthetic 
polynucleotides are produced simultaneously in a 
^^.^..^^ ;r*.*!S<j, th?t oenas thus obtained are 
introduced into host cells7that the independent 
clones of the transformed host cells containing 
these genes arc simultaneously cultivated so as to 
clone the stochastic genes and lead to the 
25 production of proteins expressed by each of these 
stochastic genes, that screening and/or selection is 
carried out on such clones of transformed host cells 
to identify those clones producing peptides or 
polypeptides having at least one specined property, 
30 that the clones so identified are isolated, then grown 
in a manner so as to produce at least one peptide or 
polypeptide having the said property. 

2. Process according to claim 1, characterized by 
the fact that the genes are produced by stochastic 
35 copoiymerization of the four types of 

deoxyphosphonucleotides A, C, G and T, starting 
from the two extremities of an expression vector 
which was previously linearized, then by fonnation 
of cohesive extremities to create a first strand of 
40 stochastic DNA constituted of a molecule of 
expression vector possessing two stochastic 
sequences whose 3* extremities are 
complementary, followed by synthesis of the 
second strand of the stochastic DNA. 
45 • 3. Process according to claim 1. characterized by 
the fact that the genes are produced by stochastic 
copoiymerization of double stranded 

«*»Sf,>K Hrt nnt havn ftnhftsive endS, 
uiiyuiiu**icv«w** • r" ^ 

in a manner so as to form fragments of stochastic 
50 DNA. followed by ligation of those fragments In an 
expression vector which was previou* <•> lineari/ed. 

4. Process according to claim 2 or 3, unaracteriz^d 
by the fact that the expression vector is a plasmid. 

5. Process according to claim 4, characterized by 
55 the fact that the expression vector is pUC8. 

6. Process according to claim 2 or claim 3. 
characterized by the fact that expression vector is a 
fragment of viral DNA. 

7. Process according to cla[m 2 or claim 3, 
60 characterized by the fact that the expression vector 

is a hybrid of plasmid and viral DNA. 

8. Process according to claim 1 to 6, characterized 
by the fact that the host cells are prokoryolic coHs. 

9. Process according to claims 1 XO^l charocierlzod 
65 by the fact that the host ceils are eukaryoiic colls. 



10. Process according to claim 8, chat -terized by 
the fact that the cells are chosen among HB101 and 

^^l! Process according to claim 3, characterized by 
70 the fict that the oligonucleotides form a group of 

palindromic octamers. ^ 

1 2. Process according to claim 1 1 charactenzed by 
the fact that the group of palindromic octamers is 
the following group : 

75 » 

5' GGAATTCCS* 

5* GGTCGACC3' 

5' CAAGCTTG 3' 

5'CCATATGG3' 

80 V 5'CATCGAT63' 



85 



13. Process according to claim 3, characterized by 
the fact that the oligonucleotides form a group of 
palindromic heptamers- 

14. Process according to claim 13, characterized 
tHo fact tha? the group of palindromic heptanners 

is the following group: 

5'XTC6CGA3' 
5' XCTGCAG3' 
90 5'RGGTACC3' 

where X= A, G, C, or T, and R- A or T, 

15. Process according to claim 4 and one of the 
claim 12 or 14, characterized by the fact that one first 

95 isolates and purifies the transforming DNA derh/Bd 
from a culture of independent clones of transformed 
host cells obtained by proceeding in the manner 
specified in claim 1 1 or in claim 13, then that one 
cuts the purified DNA by means of at least one 
100 restriction enzyme which corresponds to a specific 
restriction site present in these palindromic 
octamers or heptamers. but absent from the 
expression vector being utilized, that one thereafter 
simultaneously treats the ensemble of linearized 
1 05 stochastic DNA fragments so obtained by T4 DNA 
ligase in such a mahner as to create a new ensemble 
of DNA containing new stochastic sequences, and 
that one uses this new ensemble of transforming 
DNA to transform host cells and clone such genes, 
110 and finally that one carries out screening and/or 
selection and isolates the now clones of 
transformed cells and that one grows these so as to 
produce at least one peptide or polypeptide having 
a desired property. 
115 IB. Process according to claim 1, characterized by 
the fact that the said property is the capacity to 
catalyse a given chemical reaction. 

17, Process according to tiaim 1 for the 
production of several peptides and/or polypeptides, 

120 characterized by the fact that the said property is the 
capacity to catalyse a sequence of reactions leading 
from a given group of initial chemical compounds to 
at least one target compound. 

18. Process according to claim lb lor the 
1 25 production of an onsomblo consisting of more than 

one peptldo and/or polypeptide which is refloxivoly 
autocatalytic, characierizod by the fact that the said 
property is the capacity to catalyse the synthesis of 
the ensemble ilsoll starting from amino acids and/or 
130 oligopepttdoa. 



12 
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19. Process according to claim 1, characterizad by 
the fact that the said property is the capacity to 
modify setectivety the chemical and/or biological 
properties of a given compound, 

20. Process according to claim 19, characterized 70 
by the'fact that the said property is the capacity to . 
modify selectively the catalytic activity of a 
potypepttde. 

21 . Process according to claim 1 9, characterized 
by the fact that the said property is the capadty to 75 
simulate or modify at least one biological function 
ofatteastone biologically active cornpound.- • 

22. Process according to claim 21, characterized 
by the fact that the said biologically active 
compound is chosen among the hormones, 80 
neurotransmitters, adhesion or growth factors, and 
specific regulators or replication and/or 
transcription of DNA, and/or translation of RNA. 

23. Process according to daim 1, characterized by 
the fact that the said property is the capadty to bind 85 
toagivehligand. 

24. Utilization of a peptide or polypeptide 
obtained by the process according to claim 23 for 
the detection and/or titration of the ligand; - 

25. Process according to daim 1 , characterized by 90 
the-fact that the property is to have at least one 
epitope similar to one of the epitopes of a given 
antigen. 

25. Process according to claim laand 25, . 
characterized tiy the fact that the said property is the 95 
capadty to simulate or modify the effects of a 
blologrcally'active molecute^^and^that screening and/ 
or selection^of the clones'of transforrfied host cells 
producing aMeast one peptide orpolypepttde 
having this property is carried out by preparing 100 
antibodies against that molecule, and utilizing these 
antibodies so obtained to iderltify those clones 
containing those peptides or polypeptides, then by 
growing the clones thus identified and separating 
and purifying the peptide or polypeptide produced 1 05 
by these clones, and finally by submitting these 
peptide(s) or polypeptide(s) to-anassay m*wm> to 
verify that it has in fact the capadty to simulate or 
modify the effects of the said molecule. 

27. Peptides or polypeptides obtained by the 110 
process according to claim 1 or daim 26, utilizable 
as active substances having a pharmacologic and/or 
chemotherapeutic action. 

28. Peptides or polypeptides, obtained by the 
process according to daim 25, utilizable to diminish, 115 
m Vitro or in vivo, the cdncentratibh of free 
antibodies, spedfic for the said antigen, by 
formation of bonds t>etween these peotldespr 
polypeptides and these antibodies. 

29. Peptides or polypeptides according to claims 1 20 

27 or 28, utilizatHe as agents to suppress 

immunological hypersensitivity. 

30. Peptides or polypeptides, obtained by the 
process according to claim 25, utilizable os agents to 
create tolerance with respect to the said antigen. 1 25 

31. Process according to daim 25, characterized 
by the fact that the antigen is EGF. 

32. Peptides or polypeptides obtained the 
process according to daim 31, utilizable for the 
chemotherapeutic treatment of epitheliomas. 1 30 



33. Process according to claim 1, characterized by 
the fact that clones of transformed host cells 
producing peptides or polypeptides having the 
desired property are Identlfled and isolated by 
affinity chromatography on antibodies 
corresponding to a protein expressed by the natural 
fragment of the DNA hybrid. 

34. Process according to claim 33, characterized 
by the fact that the natural fragment of the ONA 
hybrid contains a gene expressing p-galactosldase, 
and that one identifies and isolates the said clones 
of transformed cell hosts by affinity 
chromatography with anti-p-galactosidase 
antibodies. 

35. Process according to claim 1, or claim 34, 
characterized by the fact that after expression and 
purification of the hybrid peptides or polypeptides, 
the novel fragments are separated and isolated. 

36. Application of the process according to daim 
25 or claim 26. for the preparation of a vacdne, 
characterized by the fact that antibodies against a 
pathogenic agent are obtained and used to identify 
those clones produdng at least one protein having 
at least one epitopasimilar to one of the epitopes of 
the pathogenic agent, that the corresponding dones 
of transformed host cells are grown in such a 
manner as to produce this protein, that the protein is 
isolated and purified from the cultures of dones of 

.cells and that this protein is used for the production 
of a vaccine against the pathogenic agent. 

37. Application according to claim 36, for the 
preparation of an anti-HVB vaccine, characterized by 
the fact that at least one capsid protein of the HVB 
virus is .extracted and purified, that this protein is 
injected into the body of an animal capable of 
forming antibodies against this protein, that these 
antibodies are recovered and purified, that these 
antibodies are used to Identify those clones 
producing at least one protein having at least one 
epitope simitar to one of the epitopes of the HVB 
virus, that the clones of transformed host cells 
corresponding to th<^53 dones are grown in a 
manner to produce this protein, that this protein is 
isolated and purified from these cultures of host 
cells, and that this protein is used for the production 
of an anti-HVB vaccine. 

38. Process according tm claim 1, characterized by 
the fact that the host cells consist in bacteria of 
Escherichia coli type, whose genome contains 
neither the natural p*galactosidase gene, nor the 
EBG gene, that is Z", EGB~ E coli, and that these 
transformed host cells are cultured in an X-gai 
medtum.also containing the inducer IPTG,that 
dones which are positive for p-galactosidase 
function are detected in the culture milieu, that 
thereafier this QNA is transplanted to a clone of host 
celts appropriate for industrial production of at least 
one peptide, polypeptide or protein with p- 
gatactosidase function. 

39. Process according to claim 1, characterized by 
the fact that the said property is the capacity to bind 
to a given compound. 

40. Process according to claim 39, characterized 
by the fact that the said compound is chosen among 
the peptides, polypeptides and proteins. 
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41 Process according to claim 40, charBCtorized 
by the fact that the said proteins are proteins 
regulating the transcription activity or replication of 

S ° 42 Process according to claim 39. characterized 
by the fart that the said compound is chosen among 
the sequences of DNA and RNA. 

43. Proteins obtained by the process according to 
claim 40 or claim 42. 

in 44. Process for the production of DNA. 

characterized by the fact that, in the same milieu, 
genes which are at least partially composed of 
;tochastlciyntheiicpolynudeot.desei*pro(^ucad 
thatthegenes so produced are introduced Into host 

15 cell3 In a manner to produce an ensemble Of 

transforrr.edhos.cells.thatthe5earegrownsDasto 
prducelndependcnt clones ofthehoatcels«a 
produced, that screer.!ng and/or MlecUontew^^^^ 
out on this ensemble to identify those hc*t ceHs 

20 which contain those ''t~h'««"«21I'r^.il su* 
having at least one desired property, andthat such 
":yi,:rl.M.Hfr«mthe.identined cultures of the 

t^f%r^ I** •*,.w.— • 

*^°45'process according to claim 44, c».arBctetizod 
25 by the fact that the said property is the capacity to 

bind to given compound. ^ , j 

46 Process according to claim 45. characterized 

by the fact that the said compound is chosen among 
the peptides, polypeptides and proteins. . 

-,0 ^47 Process according to dalm 45. characterized 
by the fact that the said compound is a compound 
regulating the transcription activity orthe 
reolication of DNA. . • j 

48 Process according to claim 47. characterized 

35 by the fact that the said compound is a refl"««°;;j' 
protein controlling the transcription or replication of 

DMA. 



49. UtllizBllon of a sequence of DNA obtained by 
the process according to claim 46. or claim 47, as a 
40 cii-regulatory sequence «P"«=^;°" . r^. 
transcription of a neighboring sequence of DNA. 

S Process according to dalm 42, charart^^^^^^ 
bYlhefactthBttheprotoinsobtalnedhavatho 
capacltytomodlfy the transcription actMty.the 

4S reolication. or thB stability of DNA. • 
6?Utllli8tionofaproteinobtaInedbythepro^ 
accordlngtoclalm48, to modify the transcriptio^^^^^ 
replication or stability of a sequence of DNA in a cell 
corllainlng this sequence ol DNA and expressing 

50 this protein. ,b»ia 
52 Process for the production of RN A, 
characterized by the fact that, in the same miNeo. 

genes whichare at least partially composedof 
^ntheticstochasticpolynucleotjdesarep^^^ 
55 6lmullaneously,thBttheyene8thusobla.^^^^^^^^^ 
Introduced In hostcellsin a manner to produce an 
ensemble of transformed host cells, the the 
Independent clones of transformed host cells so 
produced are grown simultaneously, that the 

. , ■/ _i«Mi/in Is earned out cn this 

:3r;a;Vn;;;to identify those host 
Snteining stochastic sequencesofRNAhaving at 
least one desired propcrty.andthat^heRNAis 
isolated from the cultures of host cells so Identined. 

53. Process according to claim 52, characterized 
by the fact that the said property is the capacity to 
birnl to a given compound. , 

54. Process according to claim 52. charactenzed 
by the fact that the said prope rty is the capacity to 
catalyse a given chemical reaction. _ 

55. Process according to claim 52, charactenzed 
by the fact that the said property is to oe a transfer 
RNA. 
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